A state-tying approach to building syllable HMMs
نویسندگان
چکیده
A severe sparse data problem is faced when building HMMs of syllable units due to the uneven distribution of syllables in natural speech. In this paper we present a novel approach to building syllable HMMs which attempts to overcome this problem. The method involves tying the states in syllable models using a bottom-up clustering algorithm and a state similarity measure employing phonetic information. Experiments using the new approach on the TIMIT database show that it improves the recognition accuracy of syllable HMMs. We present encouraging results which show that when the state-tying method is used in conjunction with a MultiModel [11] approach to acoustic modeling a syllable identification accuracy of 53.4% can be achieved. This equates to a phoneme accuracy of 72.8% which is comparable with the best results achieved using triphone HMMs.
منابع مشابه
A discriminative decision tree learning approach to acoustic modeling
The decision tree is a popular method to accomplish tying of the states of a set context dependent phone HMMs for efficient and effective training of the large acoustic models. A likelihood-based impurity function is commonly adopted. It is well known that maximizing likelihood does not result in the maximal separation between the distributions in the leaves of the tree. To improve robustness, ...
متن کاملAsynchronous-transition HMM
We propose a new class of hidden Markov model (HMM) called asynchronous-transition HMM (AT-HMM). Opposed to conventional HMMs where hidden state transition occurs simultaneously to all features, the new class of HMM allows state transitions asynchronized between individual features to better model asynchronous timings of acoustic feature changes. In this paper, we focus on a particular class of...
متن کاملSoft state-tying for HMM-based speech recognition
This paper introduces a method for regularization of HMM systems that avoids parameteroverfitting causedby insufficient training data. Regularization is done by augmenting the EM training method by a penalty term that favors simple and smooth HMM systems. The penalty term is constructed as a mixture model of negative exponential distributions that is assumed to generate the state dependent emis...
متن کاملSpeaker adaptation using tree structured shared-state HMMs
This paper proposes a novel speaker adaptation method that exibly controls state-sharing of HMMs according to the amount of adaptation data. In our scheme, acoustic modeling is combined with adaptation to e ciently utilize the acoustic models sharing characteristics for adaptation. The shared-state set of HMMs is determined by using tree-structured shared-state HMMs created from the history rec...
متن کاملComparison among Various Question Generations for Decision Tree Based State Tying in Persian Language
Performance of any continuous speech recognition system is highly dependent on performance of the acoustic models. Generally, development of the robust spoken language technology relies on the availability of large amounts of data. Common way to cope with little data for training each state of Markov models is treebased state tying. This tying method applies contextual questions to tie states. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002